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CRYSTALLIZATION OF 
AURORA/LPLIP-RELATED KINASE 

RELATED APPLICATION 

[001] This application claims the benefit of U.S. Provisional Application No. 

60/390,355, filed June 21, 2002, which is incorporated herein by reference. 

FIELD OF THE INVENTION 

[002] The present invention relates to Aurora/LPLlF-related kinase ("AIK") and more 

specifically to AIK in crystalline form, methods of forming crystals comprising AIK, methods 
of using crystals comprising ADC, a crystal structure of AIK, and methods of using the crystal 
structure. 

BACKGROUND OF THE INVENTION 

[003] The most dangerous forms of cancer comprise malignant cells that metastasize 

to distant sites in a body. Metastatic cells have the property of being able to break away fi'om a 
primary tumor, translocate to distant sites, and colonize distant and foreign microenvironments. 
Cancer cell metastasis requires cellular capacity to 1) detach fi'om a primary tumor, 2) migrate 
and invade through local tissues, 3) translocate to distant sites in the body (via lymph or 
blood), 4) colonize a foreign site, and 5) grow and survive in this foreign environment. All of 
these behaviors are linked to cell adhesions. 

[004] Cell adhesions control the physical interactions of cells with their 

microenvironment. Cell adhesions also initiate signals that dictate tumor cell growth, death, 
and differentiation. At the cellular level, metastatic cells have overcome restraints upon cell 
growth and migration that result from physical linkages and signals conveyed by cell-cell 
contacts. Malignant cells often have increased interactions with surroimding extracellular 
matrix (ECM) proteins that provide linkages and signals that promote several aspects of 
metastasis. 

[005] A general approach to designing inhibitors that are selective for a given protein 

is to determine how a putative inhibitor interacts with a three dimensional structure of that 
protein. For this reason it is usefiil to obtain the protein in crystalline form and perform X-ray 
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diffraction techniques to determine the protein's three dimensional structure coordinates. 
Various methods for preparing crystalline proteins are known in the art. 
[006] Once protein crystals are produced, crystallographic data can be generated using 

the crystals to provide useftil structural information that assists the design of small molecules 
that bind to the active site of the protein and inhibit the protein's activity in vivo. If the protein 
is crystallized as a complex with a ligand, one can determine both the shape of the protein's 
binding pocket when boimd to the ligand, as well as the amino acid residues that are capable of 
close contact with the ligand. By knowing the shape and amino acid residues in the binding 
pocket, one may design new ligands that will interact favorably with the protein. With such 
structural information, available computational methods may be used to predict how strong the 
ligand binding interaction will be. Such methods help in the design of inhibitors that bind 
strongly, as well as selectively to the protein. 

SUMMARY OF THE INVENTION 

[007] The present invention is directed to crystals comprising AIK and particularly 

crystals comprising AIK that have sufficient size and quality to obtain useful information about 
the structural properties of AIK and molecules or complexes that may associate with AIK. 
[008] In one embodiment, a composition is provided that comprises a protein in 

crystalline form wherein at least a portion of the protein has 55%, 65%, 75%, 85%, 90%, 95%, 
97%, 99% or greater identity with residues 125-391 or 126-388 of SEQ. ID No. 1. 
[009] In one variation, the protein has activity characteristic of AIK. For example, the 

protein may optionally be inhibited by inhibitors of wild type AIK. The protein crystal may 
also diffract X-rays for a determination of structure coordinates to a resolution of 4A, 3. 5 A, 
3.0A or less. 

[0010] In one variation, the protein crystal has a crystal lattice in a P6i22 space group. 

The protein crystal may also have a crystal lattice having unit cell dimensions, +/- 5%, of 
a=80.45A, b= 80.45A and c=172.18A. 

[0011] The present invention is also directed to crystallizing ADC. The present 

invention is also directed to the conditions useftil for crystallizing AIK. It should be 
recognized that a wide variety of crystallization methods can be used in combination with the 
crystallization conditions to form crystals comprising AIK including, but not limited to, vapor 
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diffusion, batch, dialysis, and other methods of contacting the protein solution for the purpose 
of crystalUzation. 

[0012] The present invention is also directed to crystallizing ADC. The present 

invention is also directed to the conditions usefiil for crystalUzing AIK. It should be 
recognized that a wide variety of crystallization methods can be used in combination with the 
crystallization conditions to form crystals comprising AIK including, but not limited to, vapor 
diffusion, batch, dialysis, and other methods of contacting the protein solution for the purpose 
of crystallization. 

[0013] In one embodiment, a method is provided for forming crystals of a protein 

comprising: forming a crystallization volume comprising: a protein wherein at lesist a portion 
of the protein has 55%, 65%, 75%, 85%, 90%, 95%, 97%, 99% or greater identity with 
residues 125-391 or 126-388 of SEQ. ED No. 1; and storing the crystallization volume under 
conditions suitable for crystal formation. . 

[0014] In one variation, the crystallization volume comprises the protein in a 

concentration between 1 mg/ml and 50 mg/ml, and 1-50% w/v of precipitant wherein the 
precipitant comprises one or more members of the group consisting of MPD, PEG 100-6000, 
PEG MME 550-5000, ammonium salt, glycerol and ethylene glycol, and wherein the 
crystallization volume has a pH between pH 6.5 and pH 10. 

[OOIS] The method may optionally further comprise forming a protein crystal that has a 

crystal lattice in a P6i22 space group. The method also optionally further comprises forming a 
protein crystal that has a crystal lattice having unit cell dimensions, +/- 5%, of a=80.45 A, b= 
80.45 A and c=172.18A. The invention also relates to protein crystals formed by these 
methods. 

[0016] The present invention is also directed to a composition comprising an isolated 

protein that comprises or consists of one or more of the protein sequence(s) of AIK taught 
herein for crystallizing AIK. The present invention is also directed to a composition 
comprising an isolated nucleic acid molecule that comprises or consists of the nucleotides for 
expressing the protein sequence of AIK taught herein for crystallizing AIK. 
[0017] The present invention is also directed to an expression vector that may be used 

to express the isolated proteins taugjit herein for crystallizing AIK. 
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[0018] The present invention is also directed to an expression vector that may be used 

to express the isolated proteins taught herein for crystallizing AIK. In one variation, the 
expression vector comprises a promoter that promotes expression of the isolated protein. 
[00191 The present invention is also directed to a cell line transformed or transfected by 

an isolated nucleic acid molecule or expression vector of the present invention. 
[0020] The present invention is also directed to stmcture coordinates for ADC as well as 

structure coordinates that are comparatively similar to these stmcture coordinates. It is noted 
that these comparatively similar stmcture coordinates may encompass protefais with similar 
sequences and/or stmctures, such as other kinase. For example, machine-readable data storage 
media is provided having data storage material encoded with machine-readable data that 
comprises stmcture coordinates that are comparatively similar to the stmcture coordinates of 
AIK. The present invention is also directed to a machine readable data storage medium having 
data storage material encoded with nfiachine readable data, which, when read by an appropriate 
machine, can display a three dimensional representation of all or a portion of a stmcture of 
AIK or a model that is comparatively similar to the stmcture of all or a portion of AIK. 
[0021] Various embodiments of machine readable data storage medium are provided 

that comprise data storage material encoded with machine readable data. The machine 
readable data comprises: stmcture coordinates that have a root mean square deviation equal to 
or less than the RMSD value specified in Columns 3, 4 or 5 of Table 1 when compared to the 
stmcture coordinates of Figure 3, the root mean square deviation being calculated such that the 
portion of amino acid residues specified in Column 2 of Table 1 of each set of stmcture 
coordinates are superimposed and the root mean square deviation is based only on those amino 
acid residues in the stmcture coordinates that are also present in the portion of the protein 
specified in specified in Column 1 of Table 1. The amino acids being overlayed and compared 
need not to be identical when the RMSD calculation is performed on alpha carbons and main 
chain atoms but the amino acids being overlayed and compared must have identical side chains 
when the RMSD calculation is performed on all non-hydrogen atoms. 

[0022] For example, in one embodiment where the comparison is based on the 4 

Angstrom set of amino acid residues (Column 1) and is based on superimposing alpha-carbon 
atoms (Column 2), the stmcture coordinates may have a root mean square deviation equal to or 
less than 0.44 when compared to the stmcture coordinates of Figure 3. 
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TABLE 1 



AA RESIDUES TO 
USE TO PERFORM 
RMSD 
COMPARISON 


PORTION OF EACH AA 
RESIDUE USED TO 
PERFORM RMSD 
COMPARISON 


RMSD VALUE LESS 
THAN OR EQUAL TO 


Table 2 
(4 Angstrom set) 


alpha-carbon atoms 


0.44 


0.30 


0.22 


main-chain atoms' 


0.43 


0.28 


0.21 


all non-hydrogen 


0.61 


0.41 


0.31 


Table 3 
(7 Angstrom set) 


alpha-carbon atoms' 


0.48 


0.31 


0.24 


main-chain atoms' 


0.47 


0.31 


0.23 


all non-hydrogen'^ 


0.65 


0.44 


0.33 


Table 4 
(10 Angstrom set) 


alpha-carbon atoms* 


0.82 


0.55 


0.41 


main-chain atoms' 


0.79 


0.53 


0.40 


all non-hydrogen^ 


0.93 


0.62 


0.47 


125-391 or 126-388 of 
SEQ. ID No. 1 


alpha-carbon atoms 


1.16 


0.78 


0.58 


main-chain atoms' 


1.15 


0.78 


0.58 


all non-hydrogen"^ 


1.20 


0.79 


0.60 



- the RMSD computed between the atoms of all amino acids that are common to both the 
target and the reference in the aligned and superposed structure. The amino acids need not to 
be identical. 

- the RMSD computed only between identical amino acids, which are common to both the 
target and the reference in the aligned and superposed stmcture. 

[0023] The present invention is also directed to a three-dimensional structure of all or a 

portion of AIK. This three-dimensional structure may be used to identify binding sites, to 
provide mutants having desirable binding properties, and ultimately, to design, characterize, or 
identify ligands capable of interacting with AIK. Ligands that interact with ADC may be any 
type of atom, compound, protein or chemical group that binds to or otherwise associates with 
the protein. Examples of types of ligands include natural substrates for AIK, inhibitors of AIK, 
and heavy atoms. The inhibitors of AIK may optionally be used as dmgs to treat therapeutic 
indications by modifying the in vivo activity of AIK. 

[0024] In various embodiments, methods are provided for displaying a three 

dimensional representation of a stmcture of a protein comprising: 

taking machine readable data comprising stmcture coordinates that have a root mean 
square deviation equal to or less than the RMSD value specified in Columns 3, 4 or 5 of Table 
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1 when compared to the structure coordinates of Figure 3, the root mean square deviation being 
calculated such that the portion of amino acid residues specified in Column 2 of Table 1 of 
each set of structure coordinates are superimposed and the root mean square deviation is based 
only on those amino acid residues in the structure coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table 1; 

computing a three dimensional representation of a structure based on the structure 
coordinates; and 

displaying the three dimensional representation. 
[0025] The present invention is also directed to a method for solving a three- 

dimensional crystal structure of a target protein using the structure of ADC. 
[0026] In various embodiments, computational methods are provided comprising: 

taking machine readable data comprising structure coordinates that have a root mean square 
deviation equal to or less than the RMSD value specified in Colunms 3, 4 or 5 of Table 1 when 
compared to the structure coordinates of Figure 3, the root mean square deviation being 
calculated such that the portion of amino acid residues specified in Column 2 of Table 1 of 
each set of structure coordinates are superimposed and the root mean square deviation is based 
only on those amino acid residues in the structure coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table 1; 

computing phases based on the structural coordinates; 

computing an electron density map based on the computed phases; and 

determining a three-dimensional crystal structure based on the computed electron 
density map. 

[0027] In various embodiments, computational methods are provided comprising: 

taking an X-ray diffraction pattern of a crystal of the target protein; and computing a three- 
dimensional electron density map from the X-ray diffraction pattem by molecular replacement, 
wherein structure coordinates used as a molecular replacement model comprise structure 
coordinates that have a root mean square deviation equal to or less than the RMSD value 
specified in Colxmins 3, 4 or 5 of Table 1 when compared to the structure coordinates of Figure 
3, the root mean square deviation being calculated such that the portion of amino acid residues 
specified in Column 2 of Table 1 of each set of structure coordinates are superimposed and the 
root mean square deviation is based only on those amino acid residues in the structure 
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coordinates that are also present in the portion of the protein specified in specified in Column 1 
of Table 1. 

[0028] These methods may optionally further comprise determining a three- 

dimensional crystal structure based upon the computed three-dimensional electron density 
map. 

[0029] The present invention is also directed to using a crystal structure of AIK, in 

particular the structure coordinates of AIK and the surface contour defined by them, in 
methods for screening, designing, or optimizing molecules or other chemical entities that 
interact with and preferably inhibit AIK. 

[0030] One skilled in the art will appreciate the numerous uses of the inventions 

described herein, particularly in the areas of drug design, screening and optimization of drug 
candidates, as well as in determining additional unknown crystal structures. For example, a 
further aspect of the present invention relates to using a tiiree-dimensional crystal structure of 
all or a portion of AIK and/or its structure coordinates to evaluate the ability of entities to 
associate with AIK. The entities may be any entity that may function as a ligand and thus may 
be any type of atom, compound, protein (such as antibodies) or chemical group that can bind to 
or otherwise associate with a protein. 

[0031] In various embodiments, methods are provided for evaluating a potential of an 

entity to associate with a protein comprising: 

creating a computer model of a protein structure using structure coordinates that 
comprise structure coordinates that have a root mean square deviation equal to or less than the 
RMSD value specified in Columns 3, 4 or 5 of Table 1 when compared to the structure 
coordinates of Figure 3, the root mean square deviation being calculated such that the portion 
of amino acid residues specified in Column 2 of Table 1 of each set of structure coordinates are 
superimposed and the root mean square deviation is based only on those amino acid residues in 
the structure coordinates that are also present in the portion of the protein specified in specified 
in Column 1 of Table 1; 

performing a fitting operation between the entity and the computer model; and 
analyzing results of the fitting operation to quantify an association between the entity 
and the model. 
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[0032] In other embodiments, methods are provided for identifying entities that can 

associate with a protein comprising: generating a three-dimensional structure of a protein using 
structure coordinates that comprise structure coordinates that have a root mean square 
deviation equal to or less than the RMSD value specified in Columns 3, 4 or 5 of Table 1 when 
compared to the structure coordinates of Figure 3, the root mean square deviation being 
calculated such that the portion of anfiino acid residues specified in Colunan 2 of Table 1 of 
each set of structure coordinates are superimposed and the root mean square deviation is based 
only on those amino acid residues in the structure coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table 1; and 

employing the three-dimensional structure to design or select an entity that can 
associate with the protein; and contacting the entity with a protein wherein at least a portion of 
the protein has 55%, 65%, 75%, 85%, 90%, 95%, 97%, 99% or greater identity with residues 
125-391 or 126-388 of SEQ. IDNo. 1. 

[0033] In other embodiments, methods are provided for identifying entities that can 

associate with a protein comprismg: 

generating a three-dimensional structure of a protein using structiu'e coordinates that 
comprise stmcture coordinates that have a root mean square deviation equal to or less than the 
RMSD value specified in Columns 3, 4 or 5 of Table 1 when compared to the structure 
coordinates of Figure 3, the root mean square deviation being calculated such that the portion 
of amino acid residues specified in Column 2 of Table 1 of each set of stmcture coordinates are 
superimposed and the root mean square deviation is based only on those amino acid residues in 
the stmcture coordinates that are also present in the portion of the protein specified in specified 
in Column 1 of Table 1; and 

employing the three-dimensional structure to design or select an entity that can 
associate with the protein. 

[0034] In other embodiments, methods are provided for identifying entities that can 

associate with a protein comprising: 

computing a computer model for a protein binding pocket, at least a portion of the 
computer model having a surface contour that has a root mean square deviation equal to or less 
than a given RMSD value specified in Columns 3, 4 or 5 of Table 1 when the coordinates used 
to compute the surface contour are compared to the stmcture coordinates of Figure 3, wherein 
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(a) the root mean square deviation is calculated by the calculation method set forth herein, (b) 
the portion of amino acid residues associated with the given RMSD value in Table 1 (specified 
in Column 2 of Table 1) are superimposed according to the RMSD calculation, and (c) the root 
mean square deviation is calculated based only on those amino acid residues present in both the 
protein being modeled and the portion of the protein associated with the given RMSD in Table 
1 (specified in Column 1 of Table 1); 

employing the computer model to design or select an entity that can associate with the 
protein; and contacting the entity with a protein wherein at least a portion of the protein has 
55%, 65%, 75%, 85%, 90%, 95%, 97%, 99% or greater identity with residues 125-391 or 126- 
388 ofSEQ.IDNo. 1. 

[0035] In other embodiments, methods are provided for identifying entities that can 

associate with a protein comprising: 

computing a computer model for a protein binding pocket, at least a portion of the 
computer model having a surface contour that has a root mean square deviation equal to or less 
than a given RMSD value specified in Columns 3, 4 or 5 of Table 1 when the coordinates used 
to compute the surface contour are compared to the structure coordinates of Figure 3, wherein 
(a) the root mean square deviation is calculated by the calculation method set forth herein, (b) 
the portion of amino acid residues associated with the given RMSD value in Table 1 (specified 
in Column 2 of Table 1) are superimposed according to the RMSD calculation, and (c) the root 
mean square deviation is calculated based only on those amino acid residues present in both the 
protein being modeled and the portion of the protein associated with the given RMSD in Table 
1 (specified in Column 1 of Table 1); and 

employing the computer model to design or select an entity that can associate with the 

protein. 

[0036] In other embodiments, methods are provided for evaluating the ability of an 

entity to associate with a protein, the method comprising: 

constructing a computer model defined by structure coordinates that have a root mean 
square deviation equal to or less than the RMSD value specified in Columns 3, 4 or 5 of Table 
1 when compared to the structure coordinates of Figure 3, the root mean square deviation being 
calculated such that the portion of amino acid residues specified in Column 2 of Table 1 of 
each set of structure coordinates are superimposed and the root mean square deviation is based 
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only on those amino acid residues in the stmctxire coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table 1; and 

selecting an entity to be evaluated by a method selected from the group consisting of (i) 
assembling molecular fragments into the entity, (ii) selecting an entity from a small molecule 
database, (iii) de novo ligand design of the entity, and (iv) modifying a known ligand for ADC, 
or a portion thereof; performing a fitting program operation between computer models of the 
entity to be evaluated and the binding pocket in order to provide an energy-minimized 
configuration of the entity in the binding pocket; and evaluating the results of the fitting 
operation to quantify the association between the entity and the binding pocket model in order 
to evaluate the ability of the entity to associate with the binding pocket, 
[00371 other embodiments, methods are provided for evaluating the ability of an 

entity to associate with a protein, the method comprising: 

computing a computer model for a protein binding pocket, at least a portion of the 
computer model having a surface contour that has a root mean square deviation equal to or less 
than a given RMSD value specified in Columns 3, 4 or 5 of Table 1 when the coordinates used 
to compute the surface contour are compared to the structure coordinates of Figure 3, wherein 
(a) the root mean square deviation is calculated by the calculation method set forth herein, (b) 
the portion of amino acid residues associated with the given RMSD value in Table 1 (specified 
in Column 2 of Table 1) are superimposed according to the RMSD calculation, and (c) the root 
mean square deviation is calculated based only on those amino acid residues present in both the 
protein being modeled and the portion of the protein associated with the given RMSD in Table 
1 (specified in Column 1 of Table 1); and 

selecting an entity to be evaluated by a method selected from the group consisting of (i) 
assembling molecular fragments into the entity, (ii) selecting an entity from a small molecule 
database, (iii) de novo ligand design of the entity, and (iv) modifying a known ligand for ADC, 
or a portion thereof; performing a fitting program operation between computer models of the 
entity to be evaluated and the binding pocket in order to provide an energy-minimized 
configuration of the entity in the binding pocket; and evaluating the results of the fitting 
operation to quantify the association between the entity and the binding pocket model in order 
to evaluate the ability of the entity to associate with the binding pocket. 
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[0038] In regard to each of these embodiments, the protein may optionally have activity 

characteristic of AIK. For example, the protein may optionally be inhibited by inhibitors of 
wild type AIK. 

[0039] In another embodiment, a method is provided for identifying an entity that 

associates with a protein comprising: taking structure coordinates from diffraction data 
obtained from a crystal of a protein wherein at least a portion of the protein has 55%, 65%, 
75%, 85%, 90%, 95%, 97%, 99% or greater identity with residues 125-391 or 126-388 of SEQ. 
ID No. 1 ; and performing rational drug design using a three dimensional structure that is based 
on the obtained structure coordinates. 

[0040] The protein crystals may optionally have a crystal lattice with a P6i22 space 

group and unit cell dimensions, +/- 5%, of a=80.45A, b= 80.45 A and c=172.18A. 
[0041] The method may optionally ftirther comprise selecting one or more entities 

based on the rational drug design and contacting the selected entities with the protein. The 
method may also optionally further comprise measuring an activity of the protein when 
contacted with the one or more entities. The method also may optionally further comprise 
comparing activity of the protein in a presence of and in the absence of the one or more 
entities; and selecting entities where activity of the protein changes depending whether a 
particular entity is present. The method also may optionally further comprise contacting cells 
expressing the protein with the one or more entities and detecting a change in a phenotype of 
the cells when a particular entity is present. 

BRIEF DESCRIPTION OF THE FIGURES 

[0042] Figure 1 illustrates SEQ. ID Nos. 1, 2, and 3 referred to in this application. 

[0043] Figure 2 illustrates a crystal of an AIK-ATPyS complex. 

[0044] Figure 3 lists a set of atomic structure coordinates for AIK as derived by X-ray 

crystallography from a crystal that comprises the protein. The following abbreviations are 
used in Figure 3: "X, Y, Z" crystallographically define the atomic position of the element 
measured; "B" is a thermal factor that measures movement of the atom around its atomic 
center; "Occ" is an occupancy factor that refers to the fraction of the molecules in which each 
atom occupies the position specified by the coordinates (a value of "1" indicates that each atom 
has the same conformation, i.e., the same position, in all molecules of the crystal). 
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[0045] Figure 4 illustrates a ribbon diagram overview of the structure of ADC, 

highlighting the secondary structural elements of the protein. 

[0046] Figure 5 illustrates ATPyS bound in the active site of ADC based on the 

determined crystal structure for the molecule in the asymmetric unit corresponding to the 
stmcture coordinates shown in Figure 3. 

[0047] Figure 6 illustrates a system that may be used to carry out instmctions for 

displaying a crystal structure of ADC encoded on a storage medium. 

DETAILED DESCRIPTION OF THE INVENTION 

[0048] The present invention relates to one of the family of tyrosine kinases known as 

ADC. More specifically, present invention relates to ADC in crystalline form, methods of 
forming crystals comprising ADC, methods of using crystals comprising AIK, a crystal 
structure of AIK, and methods of using the crystal stmcture. 

[0049] In describing protein structure and function herein, reference is made to anndno 

acids comprising the protein. The amino acids may also be referred to by their conventional 

abbreviations; A = Ala = Alanine; T = Thr = Threonine; V = Val = Valine; C = Cys = 
Cysteine; L = Leu = Leucine; Y = Tyr - Tyrosine; I = He =^ Isoleucine; N = Asn = Asparagine; 
P ~ Pro = Proline; Q = Gbi = Glutamine; F = Phe ~ Phenylalanine; D = Asp = Aspartic Acid; 
W = Trp = Tryptophan; E = Glu = Glutamic Acid; M = Met = Methionme; K = Lys = Lysine; 
G = Gly = Glycine; R = Arg = Arginine; S = Ser = Serine; and H = His = Histidine. 

1, AIK 

[0050] ADC is a cell-cycle-regulated serine-threonine kinase that regulates centrosome 

maturation, chromosome segregation and cytokinesis, ensuring genetic integrity of progenic 
cells. AIK amplification and overexpression has been detected in breast tumors, ovarian 
tumors and may play a role in oncogenic transformation. 

[0051] It should be understood that the methods and compositions provided herein 

relating to ADC are not intended to be limited to wild type ADC but instead are also directed to 
Augments and variants of ADC as described herein. 

[0052] In one embodiment, AIK comprises the wild-type form of fiiU length ADC, set 

forth herein as SEQ. ID No. 1. (GenBank Accession Number NP_003591; Sen, S., Zhou, H. 
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and White, R.A., "A putative serine/threonine kinase encoding gene BTAK on chromosome 
20ql3 is amplified and overexpressed in human breast cancer cell lines", Oncogene 14 (18), 
2195-2200, 1997). 

[0053] In another embodiment, ADC comprises residues 125-391 of SEQ. ID No. 1 

which comprises the kinase domain of wild-type AIK. 

[0054] In another embodiment, AIK comprises residues 126-388 of SEQ. ID No. 1 

which comprises the portion of the kinase domain of wild-type AIK that are represented in the 
set of structure coordinates shown in Figures 3. 

[0055] It should be recognized that the invention may be readily extended to various 

variants of wild-type AIK and variants of fi-agments thereof In another embodiment, AIK 
comprises a sequence that has at least 65% identity, preferably at least 70%, 80%, 90%, 95% 
or higher identity with any one of the above sequences (e.g., all of SEQ. ID No. 1 or residues 
125-391 or 126-388 of SEQ. ID No. 1). 

[0056] It is also noted that the above sequences of AIK is also intended to encompass 

isoforms, mutants and fusion proteins of these sequences. Preferred fusion proteins are 
exemplified by SEQ. ID No. 3 which includes a poly-histidine (Hise) region. 
[0057] With the crystal structure provided herein, where amino acid residues are 

positioned in the structure are now known. As a result, the impact of different substitutions 
can be more easily predicted and understood. 

[0058] For example, based on the crystal structure, applicants have determined that 

AIK amino acids in Table 2 are wifliin 4 Angstroms of and therefore close enough to interact 
with ATPyS. Applicants have also determined that the amino acids of Table 3 are within 7 
Angstroms of bound ATPyS and therefore are also close enough to interact with that substrate 
or analogs thereof Further it has been determined that the amino acids of Table 4 are within 
10 Angstroms of the bound ATPyS. One or either of these sets of amino acids is preferably 
conserved in a variant of AIK. Hence, AIK may optionally comprise a sequence that has at 
least 65% identity, preferably at least 70%, 80%, 90%, 95% or higher identity with any one of 
the above sequences (e.g., all of SEQ. ID No. 1 or residues 125-391 or 126-388 of SEQ. ID 
No. 1) where at least the residues shown in Tables 2, 3, and/or 4 are conserved with the 
exception of 0, 1, 2, 3, or 4 residues. It should be recognized that one might optionally also 
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vary some of the binding site residues in order to determine the effect such changes have on 
structure or activity. 

Table 2: ATP binding site residues within 4 Angstroms of ATPyS. 



LEU 139 


ALA 160 


THR 217 


GLY 140 


LYS 162 


GLU 260 


LYS 141 


LEU 194 


ASN 261 


GLY 142 


LEU 210 


LEU 263 


LYS 143 


GLU 211 


ASP 274 


VAL 147 


ALA 213 




^.TP binding site residues within 7 Angstroms of ATPyS. 


LEU 139 


VAL 163 


TYR 219 


GLY 140 


LEU 164 


ARG 220 


LYS 141 


GLU 181 


ASP 256 


GLY 142 


GLN 185 


LYS 258 


LYS 143 


LEU 194 


PRO 259 


PHE 144 


ARG 195 


GLU 260 


GLY 145 


LEU 210 


ASN 261 


ASN 146 


GLU 211 


LEU 262 


VAL 147 


TYR 212 


LEU 263 


TYR 148 


ALA 213 


LEU 264 


LEU 159 


PRO 214 


ALA 273 


ALA 160 


LEU 215 


ASP 274 


LEU 161 


GLY 216 


PHE 275 


LYS 162 


THR 217 


GLY 276 



Table 4: ATP binding site residues within 10 Angstroms of ATPyS. 



ARG 137 


LEU 169 


ARG 220 


PRO 138 


VAL 174 


GLU 221 


LEU 139 


GLN 177 


GLN 223 


GLY 140 


LEU 178 


HIS 254 


LYS 141 


GLU 181 


ASP 256 


GLY 142 


GLN 185 


LYS 258 


LYS 143 


ILE 193 


PRO 259 
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rnb 144 


T CT T 1 C\A 

LEU 1 94 


CjLU 260 


OLY 14j 


AKCj 19j 


ASN 261 


ASN 146 


LEU 196 


LEU 262 


\7 A T 1 /I T 

VAL 14/ 


lYK 197 


T T?T T '^^'5 

LEU 263 


'T'*\7T> ^ AO 

lYK 14o 


T TTT T '^AO 

LEU 2U6 


LEU 264 


T "CT T 1 AC\ 

LEU 149 


ILE 209 


GLY 265 


ATA 1 CA 

ALA 13U 


T CT T 1 A 

LEU 210 


SER 266 


TT ICO 

ILb 15o 


CjLU 211 


LYS 271 


LEU 159 


TYR 212 


ILE 272 


ATA 1 A 
AEA 1 OU 


ATA Oil 

ALA 21 J 


ATA ^11 

ALA 2 ID 


LEU 161 


PRO 214 


ASP 274 


LYS 162 


LEU 215 


PHE 275 


VAL 163 


GLY 216 


GLY 276 


LEU 164 


THR 217 


TRP 277 


PHE 165 


VAL 218 


SER 278 



[0059] With the benefit of the crystal structure and guidance provided by Tables 2, 3, 

and 4, a wide variety of AIK variants (e.g., insertions, deletions, substitutions, etc.) that fall 
within the above specified identity ranges may be designed and manufactured utilizing 
recombinant DNA techniques well known to those skilled in the art, particularly in view of the 
knowledge of the crystal structure provided herein. These modifications can be used in a 
number of combinations to produce the variants. The present invention is useful for 
crystallizing and then solving the structure of the range of variants of AIK. 
[0060] Variants of ADC may be insertional variants in which one or more amino acid 

residues are introduced into a predetermined site in the AIK sequence. For instance, 
insertional variants can be fusions of heterologous proteins or polypeptides to the amino or 
carboxyl terminus of the subunits. 

[0061] Variants of AIK also may be substitutional variants in which at least one residue 

has been removed and a different residue inserted in its place. Non-natural amino acids (i.e. 
amino acids not normally found in native proteins), as well as isosteric analogs (amino acid or 
otherwise) may optionally be employed in substitutional variants. Examples of suitable 
substitutions are well known in the art, such as the Glu— *Asp, Ser-^Cys, Cys— >Ser, and 
His— > Ala for example. 

[0062] Another class of variants is deletional variants, which are characterized by the 

removal of one or more amino acid residues from the AIK sequence. 
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[0063] Other variants may be produced by chemically modifying amino acids of the 

native protein (e.g., diethylpyrocarbonate treatment that modifies histidine residues). Preferred 
are chemical modifications that are specific for certain amino acid side chains. Specificity may 
also be achieved by blocking other side chains with antibodies directed to the side chains to be 
protected. Chemical modification includes such reactions as oxidation, reduction, amidation, 
deamidation, or substitution of bulky groups such as polysaccharides or polyethylene glycol. 
[0064] Exemplary modifications include the modification of lysinyl and amino terminal 

residues by reaction with succinic or other carboxylic acid anhydrides. Modification with 
these agents has the effect of reversing the charge of the lysinyl residues. Other suitable 
reagents for modifying amino-containing residues include imidoesters such as methyl 
picolinimidate; pyridoxal phosphate; pyridoxal chloroborohydride; trinitrobenzenesulfonic 
acid; 0-methylisourea, 2,4-pentanedione; and transaminaseN: talyzed reaction with glyoxylate, 
and N-hydroxysuccinamide esters of polyethylene glycol or other bulky substitutions. 
[0065] Arginyl residues may be modified by reaction with a number of reagents, 

including phenylglyoxal, 2,3-butanedione, 1 ,2-cyclohexanedione, and ninhydrin. Modification 
of arginine residues requires that the reaction be performed in alkaline conditions because of 
the high pK^, of the guanidine functional group. Furthermore, these reagents may react with 
the groups of lysine as well as the arginine epsilon-amino group. 

[0066] Tyrosyl residues may also be modified to introduce spectral labels into tyrosyl 

residues by reaction with aromatic diazonium compounds or tetranitromethane, forming 0- 
acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl residues may also be 
iodinated using '"I or '^'I to prepare labeled proteins for use in radioimmunoassays. 
[0067] Carboxyl side groups (aspartyl or glutamyl) may be selectively modified by 

reaction with carbodiimides or they may be converted to asparaginyl and glutaminyl residues 
by reaction with ammonium ions. Conversely, asparaginyl and glutaminyl residues may be 
deamidated to the corresponding aspartyl or glutamyl residues, respectively, imder mildly 
acidic conditions. Either form of these residues falls within the scope of this invention. 
[0068] Other modifications that may be formed include the hydroxylation of proline 

and lysine, phosphorylation of hydroxyl groups of seryl or threonyl groups of lysine, arginine 
and histidine side chains (T. E. Creighton, Proteins: Structure and Molecular Properties, W.H. 
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Freeman & Co., San Francisco, pp. 79-86, 1983), acetylation of the N-terminal amine and 
amidation of any C-terminal carboxyl group. 

[0069] As can be seen, modifications of the nucleic sequence encoding AIK may be 

accomplished by a variety of well-known techniques, such as site-directed mutagenesis (see, 
Gillman and Smith, Gene 8:81-97 (1979) and Roberts, S. et aL, Nature 328:731-734 (1987)). 
When modifications are made, these modifications may optionally be evaluated for there affect 
on a variety of different properties including, for example, solubility, crystallizability and a 
modification to the protein's structure and activity. 

[0070] In one variation, the variant and/or firagment of wild-type AIK is functional in 

the sense that the resulting protein is capable of associating with at least one same chemical 
entity that is also capable of selectively associating with a protein comprising the kinase 
domain of wild-type AIK (e.g., residues 125-391 of SEQ. ID No. 1) since this conunon 
associative ability evidences that at least a portion of the native structure has been conserved. 
[00711 It is noted the activity of the native protein need not necessarily be conserved. 

Rather, amino acid substitutions, additions or deletions that interfere with native activity but 
which do not significantly alter the three-dimensional structure of the domain are specifically 
contemplated by the invention. Crystals comprising such variants of AIK, and the atomic 
structure coordinates obtained there firom, can be used to identify compounds that bind to the 
native domain. These compounds may affect the activity or the native domain. 
[0072] Amino acid substitutions, deletions and additions that do not significantly 

interfere with the three-dimensional structure of AIK will depend, in part, on the region where 
the substitution, addition or deletion occurs in the crystal structure. These modifications to the 
protein can now be made far more intelligently with the crystal structure information provided 
herein. In highly variable regions of the molecule, non-conservative substitutions as well as 
conservative substitutions may be tolerated without significantly disrupting the three- 
dimensional structure of the molecule. In highly conserved regions, or regions containing 
significant secondary structure, conservative amino acid substitutions are preferred. 
[0073] Conservative amino acid substitutions are well known in the art, and include 

substitutions made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the amino acid residues involved. For example, 
negatively charged amino acids include aspartic acid and glutamic acid; positively charged 
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amino acids include lysine and arginine; amino acids with uncharged polar head groups having 
similar hydrophilicity values include the following: leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. Other conservative 
amino acid substitutions are well known in the art. 

[0074] It should be understood that the protein may be produced in whole or in part by 

chemical synthesis. As a result, the selection of amino acids available for substitution or 
addition is not limited to the genetically encoded amino acids, hideed, mutants may optionally 
contain non-genetically encoded amino acids. Conservative amino acid substitutions for many 
of the commonly known non-genetically encoded amino acids are well known in the art. 
Conservative substitutions for other amino acids can be determined based on their physical 
properties as compared to the properties of the genetically encoded amino acids. 
[0075] Li some instances, it may be particularly advantageous or convenient to 

substitute, delete and/or add amino acid residues in order to provide convenient cloning sites in 
cDNA encoding the polypeptide, to aid in purification of the polypeptide, etc. Such 
substitutions, deletions and/or additions which do not substantially alter the three dimensional 
structure of AIK will be apparent to those having skills in the art, particularly in view of the 
three dimensional stracture of AIK provided herein. 

2. Cloning^ Expression and Purification of AIK 
[0076] The gene encoding AIK can be isolated from RNA, cDNA or cDNA libraries. 

In tins case, the portion of the gene encoding residues 125-391 was isolated and is shown as 
SEQ. LD. No. 2. 

[0077] Construction of expression vectors and recombinant proteins from the DNA 

sequence encoding AIK may be performed by various methods well known in the art. For 
example, these techniques may be performed according to Sambrook et al.. Molecular Cloning- 
A Laboratory Manual, Cold Spring Harbor, N.Y. (1989), and Kriegler, M., Gene Transfer and 
Expression, A Laboratory Maiual, Stockton Press, New York (1990). 

[0078] A variety of expression systems and hosts may be used for the expression of 

AIK. Example 1 provides one such expression system. 

[0079] Once expressed, piuification steps are employed to produce AIK in a relatively 

homogeneous state. In general, a higher purity solution of a protein increases the likelihood 
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that the protein will crystallize. Typical purification methods include the use of centrifugation, 
partial fractionation, using salt or organic compounds, dialysis, conventional column 
chromatography, (such as ion exchange, molecular sizing chromatography, etc.), high 
performance liquid chromatography (HPLC), and gel electrophoresis methods (see, e.g., 
Deutcher, "Guide to Protein Purification" in Methods in Enzymology (1990), Academic Press, 
Berkeley, California). 

[0080] AIK may optionally be affinity labeled during cloning, preferably with a poly- 

histidine (Hise) region, in order to facilitate purification. With the use of an affmity label, it is 
possible to perform a one-step purification process on a purification column that has a unique 
affinity for the label. The affinity label may be optionally removed after purification. These 
and other purification methods are known and will be apparent to one of skill in the art. 

3. Crystallization & Crystals Comprising AIK 
[0081] One aspect of the present invention relates to methods for forming crystals 

comprising AIK as well as crystals comprising AIK. 

[0082] In one embodiment, a method for forming crystals comprising AIK is provided 

comprising forming a crystallization volimie comprising AIK, precipitant, optionally a buffer, 
optionally a monovalent or divalent salt and optionally an organic solvent; and storing the 
crystallization volume under conditions suitable for crystal formation. 

[0083] In another embodiment, a method for forming crystals comprising AIK is 

provided comprismg forming a crystallization volume comprising AIK, one or more 
precipitants selected firom the group comprising ethylene glycol, polyethylene glycol, glycerol, 
MPD or ammonium sah (or mixtures thereof), optionally a buffer, optionally a monovalent or 
divalent salt and optionally an organic solvent; and storing the crystallization volume under 
conditions suitable for crystal formation. 

[0084] In yet another embodiment, a method for forming crystals comprising AIK is 

provided comprising forming a crystallization volume comprising AIK in solution comprising 
the components shown in Table 5; and storing the crystallization volume under conditions 
suitable for crystal formation. 
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Table 5 

Precipitant 

1-50% w/v comprising one or more of MFD, PEG 100-6000, 
PEG MME 550-5000, anunonium salt, glycerol and ethylene 
glycol 

CH 

pH 6.5-10. Buffers that may be used include, but are not limited 
to MES, TRIS, CHES, cacodylate, bicine, imidazole, acetate, 
hepes, citrate, and combinations thereof 

Protein Concentration 

1 mg/ml - 50 mg/ml 

Temperature 

4^C - 25^C 



[0085] In yet another embodiment, a method for forming crystals comprising ADC is 

provided comprising forming a crystallization volume comprising AIK; introducing crystals 
comprising AIK as nucleation sites, and storing the crystallization volxime under conditions 
suitable for crystal formation. 

[0086] Crystallization experiments may optionally be performed in volumes commonly 

used in the art, for example typically 15, 10, 5, 2 microliters or less. It is noted that the 
crystallization voliraie optionally has a volxmie of less than 1 microliter, optionally 500, 250, 
150, 100, 50 or less nanoliters. 

[0087] It is also noted that crystallization may be performed by any crystallization 

method including, but not limited to batch, dialysis and vapor diffusion (e.g., sitting drop and 
hanging drop) methods. Micro and/or macro seeding of crystals may also be performed to 
facilitate crystallization. 

[0088] It should be imderstood that forming crystals comprising AIK and crystals 

comprising AIK according to the invention are not intended to be limited to the wild-type, full 
length AIK shown in SEQ. ID No. 1 and to fragments comprising residues 125-391 or 126-388 
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of SEQ. E) No. 1 . Rather, it should be recognized that the invention may be extended to 

various other fragments and variants of wild-type ADC as described above. 

[0089] It should also be understood that forming crystals comprising ADC and crystals 

comprising AIK according to the invention may be such that AIK is complexed with one or 

more ligands. The ligand used to form the complex may be any ligand capable of binding to 

ADC. In one variation, the ligand is a natural substrate. In another variation, the ligand is an 

inhibitor. 

[0090] In one particular variation, the ligand binds to the ATP binding site of the 

protein. Examples of such ligands include, but are not limited to, small molecule inhibitors of 
AIK as well as ATP, non-hydrolyzable ATP analogs and suicide substrates. Non-hydrolyzable 
ATP analogs useful in the crystallizable compositions of this invention include AMP-PCH2P, 
AMP-PNP, AMP-PSP and AMP where the oxygen linking the second and third phosphates of 
the ATP analogs is replaced by CH2, S (ATPyS) and NH, respectively. An example of a 
suicidal substrate is 5-(p-fluorosulfonyl benzoyl) adenosine (FSBA). Preferably, the 
crystallizable compositions of this invention comprise ATPyS as the substrate. 
[0091] Optionally, the AIK complex may fiirther comprise divalent cations, especially 

magnesium or manganese cations, which may be introduced in any suitable manner. For 
example, the cations may be introduced by incubating the desired ligand with a suitable metal 
salt such as MgCli prior to incubation with the AIK protein. 

[0092] In one particular embodiment, ADC crystals have a crystal lattice in the P6i22 

space group. ADC crystals may also optionally have xmit cell dimensions, +/- 5%, of 
a=80.45A, b= 80.45 A and c=172.18A. 

[0093] AIK crystals also preferably are capable of diffracting X-rays for determination 

of atomic coordinates to a resolution of 4 A, 3 A, 2.5 A, 2 A or greater. 

[0094] Crystals comprising AIK may be formed by a variety of different methods 

known in the art. For example, crystallizations may be performed by batch, dialysis, and 
vapor diffusion (sitting drop and hanging drop) methods. A detailed description of basic 
protein crystallization setups may be found in McRee, D. and David. P., Practical Protein 
Crvstallographv. 2'^'* Ed. (1999), Academic Press Inc. Further descriptions regarding 
performing crystallization experiments are provided in Stevens, et al. (2000) Cwrr. Opin. 
Struct Biol: 10(5):558-63, and U.S. Patent Nos. 6,296,673, 5,419,278, and 5,096, 676. 
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[0095] In one variation, crystals comprising AIK are formed by mixing substantially 

pure AIK with an aqueous buffer containing a precipitant at a concentration just below a 
concentration necessary to precipitate the protein. One suitable, precipitant for crystallizing 
AIK is polyethylene glycol (PEG), which combines some of the characteristics of the salts and 
other organic precipitants (see, for example. Ward et al., J, Mol Biol 98:161, 1975, and 
McPherson, J. Biol Chem. 251:6300, 1976). 

[0096] During a crystallization experiment, water is removed by diffusion or 

evaporation to increase the concentration of the precipitant, thus creating precipitating 
conditions for the protein. In one particular variation, crystals are grown by vapor diffusion in 
hanging drops or sitting drops. According to these methods, a protein/precipitant solution is 
formed and then allowed to equilibrate in a closed container with a larger aqueous reservoir 
having a precipitant concentration for producing crystals. The protein/precipitant solution 
continues to equilibrate until crystals grow. 

[0097] By performing submicroUter volume sized crystallization experiments, as 

detailed in U.S. Patent No. 6,296,673, effective crystallization conditions for forming crystals 
of an AIK-ATPyS complex were obtained. In order to accomplish this, systematic broad 
screen crystallization trials were performed on an AIK-ATPyS complex using the sitting drop 
technique. In each experiment, a lOOnL mixture of ADC-ATPyS complex and precipitant was 
placed on a platform positioned over a well containing lOOjiL of the precipitating solution. 
Precipitate and crystal formation was detected in the sitting drops. Fine screening was then 
carried out for those crystallization conditions that appeared to produce precipitate and/or 
crystal in the drops. 

[0098] Based on the crystallization experiments that were performed, a thorough 

understanding of how different crystallization conditions affect AIK crystallization was 
obtained. Based on this understanding, a series of crystallization conditions were identified 
that may be used to form crystals comprising AIK. These conditions are summarized in Table 
5. A particular example of crystallization conditions that may be used to form crystals 
diffraction quality crystals of the ADC-ATPyS complex is detailed in Example 2. Figure 2 
illustrates crystals of the ADC-ATPyS complex formed using the crystallization conditions 
provided in Table 5. 
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[0099] One skilled in the art will recognize that the crystallization conditions provided 

in Table 5 and Example 2 can be varied and still yield protein crystals comprising AIK. For 
example, it is noted that variations on the crystallization conditions described herein can be 
readily determined by taking the conditions provided in Table 5 and performing fine screens 
around those conditions by varying the type and concentration of the components in order to 
determine additional suitable conditions for crystallizing AIK, variants of AIK, and ligand 
complexes thereof. 

[00100] Crystals comprising AIK have a wide range of uses. For example, now that 
crystals comprising AIK have been produced, it is noted that crystallizations may be performed 
using such crystals as a nucleation site within a concentrated protein solution. According to 
this variation, a concentrated protein solution is prepared and a crystalline material 
(microcrystals) is used to 'seed' the protein solution to assist nucleation for crystal growth. If 
ttie concentrations of the protein and any precipitants are optimal for crystal growth, the seed 
crystal will provide a nucleation site around which a larger crystal forms. Given the ability to 
form crystals comprising AIK according to the present invention, the crystals so formed can be 
used by this crystallization technique to initiate crystal growth of other AIK comprising 
crystals, including AIK complexed to other ligands. 

[00101] As will be described herein in greater detail, crystals may also be used to 
perform X-ray or neutron diffraction analysis in order to determine the three-dimensional 
structure of AIK and, in particular, to assist in the identification of its active site. Knowledge 
of the binding site region allows rational design and construction of ligands including 
inhibitors. Crystallization and structural determination of AIK mutants having altered 
bioactivity allows the evaluation of whether such changes are caused by general structure 
deformation or by side chain alterations at the substitution site. 

4. X"Rav Data Collection and Structure Determination 
[00102] Crystals comprising AIK may be obtained as described above in Section 3. As 
described herein, these crystals may then be used to perform x-ray data collection and for 
structure determination. 

[00103] In one embodiment, described in Example 2, crystals of an AIK-ATPyS 
complex were obtained where ADC has the sequence of residues shown in SEQ. ID No. 3 (the 
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His6 affinity tag was removed after purification). These particular crystals were used to 
determine the three dimensional structure of AIK. However, it is noted that other crystals 
comprising AIK including different AIK variants, fi-agments, and complexes thereof may also 
be used. 

[00104] Diffraction data was collected from cryocooled crystals (lOOK) of the AIK- 
ATPyS complex at the Advanced Light Source beam line 5.0.3 using an ADSC CCD detector. 
The diffraction pattem of the AIK-ATPyS complex displayed symmetry consistent with space 
group P6i22, with unit cell dimensions of a=80.45A, b= 80.45 A and c=172.18A. Data were 
collected and integrated to 1.9A with MOSFLM and scaled with SCALA (CCP4 Study 
Weekend (eds. Sawyer, L., Isaacs, N. & Bailey, S.) 56-62 (SERC Daresbury Laboratory, 
England) (1993). 

[00105] All crystallographic calculations were performed using the CCP4 program 
package (Collaborative Computational Project, N. The CCP4 Suite: Programs for Protein 
Crystallography. Acta Cryst. D50, 760-763 (1994)). The initial phases for AIK were obtained 
by the molecular replacement method using the program AMORE. The coordinates of yeast 
cAPK (PDB code IFOT) were used as a search model (38% identity) for the solution of the 
AIK-ATPyS structure. The highest solution from the translational function was subjected to a 
rigid body rotation followed by refinement against the maximimi likelihood method as 
implemented in REFMAC (CCP4). Rigid body refinement and torsional dynamics refinement 
was followed by multiple roimds of manual building with Xfit (McRee, D.E. XtalView/Xfit-A 
versatile program for manipulating atomic coordinates and electron density J. Struct Biol 125, 
156-65 (1999)) and/or ARP^WARP map improvement (Perrakis, A., Morris, RJ. & Lamzin, 
V.S. Automated protein model building combined with iterative structure refinement). All 
stages of model refinement were carried with bulk solvent correction and anisotropic scaling. 
The data collection and data refmement statistics are given in Table 6. 

TABLE 6 



Crystal data 




Ligand 


ATPyS 


Space group 


P6i22 


Unit cell dimensions 


a=b= 80.45A and 
c=172.18A 
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Data cnllf^ctinn 


AIK-ATPvS 




Bl 5 0 3 

Xi#i m/»\j0ij 


Wavelenoth fAl 


1 0 


Re^oliitinn fAl 


44-1 9 


OHsprvatinn^ Ainimie^ 


28 879 


R ediifidanc V 

XX. wU UlXKACUlW J 


7 4 


dnmnletenp^^ overall rmifpr qIipIH 


98% ^93%"^ 


^lf^(T\ ovprjill Tmifpr qVipII^ 

MJ\J\L§ \JVwlCLli y^^JULVi Oiiwli 1 




R^ ^ nvprall /'mifpr qVipII^ 


0 086 CO 5781 






JLXd.lMl Vlllwll %. 




Reflections used 


25,081 


R-factor 


22.5% 


Rfree 


26.8% 


r.m.s bonds 


0.019 


r.m.s angles 


1.99 


' Rsymm = ShkiSi | I(hkl)i-<I(hkl)> | / Shki^i <I(hkl)i> ovcr I 
observations of a reflection hkl 



[00106] Each unit cell comprised one AIK-ATPYS-(Mg^'^2 complex. Structure 
coordinates were determined for this molecule and the refined set of coordinates are presented 
in Figure 3. 

[00107] It is noted that the sequence of the stracture coordinates presented in Figure 3 
differ in some regards from the sequence shown in SEQ. ID No. 1 . 

[00108] For some residues, the electron density obtained was insufficient to identify the 
side chain. As a result, the side chains of these residues were truncated such that a different 
amino acid is reported. Table 7 summarizes the differences between SEQ. ID No. 1 and the 
truncated residues appearing in Figure 3. 

Table 7: Truncated Residues in The Structure Coordinates of Figure 3. 



126K-126A 


171K-170A 


339K-339A 


127R-127A 


175E-175A 


375R-375A 


170E-170A 


183E-183A 
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[00109] It is also noted that structure coordinates are not reported for some residues 
because the electron density obtained was insufficient to identify the position of these residues. 
For Figure 3, structure coordinates for residues 124-125, 286-287 and 389-391 are not 
reported. 

[00110] Those of skill in the art understand that a set of structure coordinates (such as 
those in Figure 3) for a protein or a protein-complex or a portion thereof, is a relative set of 
points that define a shape in three dimensions. Thus, it is possible that an entirely different set 
of structure coordinates could define a similar or identical shape. Moreover, slight variations 
in the individual coordinates may have little effect on overall shape, hi terms of binding 
pockets, these variations would not be expected to significantly alter the nature of ligands that 
could associate with those pockets. The term "binding pocket" as used herein refers to a region 
of the protein that, as a result of its shape, favorably associates with a ligand 
[00111] These variations in coordinates may be generated because of mathematical 
manipulations of the AIK stmcture coordinates. For example, the sets of structure coordinates 
shown in Figure 3 could be manipulated by crystallographic permutations of the structure 
coordinates, fractionalization of the structure coordinates, application of a rotation matrix, 
integer additions or subtractions to sets of the structure coordinates, inversion of the structure 
coordinates or any combination of the above. 

[00112] Alternatively, modifications in the crystal structure due to mutations, additions, 
substitutions, and/or deletions of amino acids or other changes in any of the components that 
make up the crystal could also account for variations in structure coordinates. If such 
variations are within an acceptable standard error as compared to the original coordinates, the 
resulting three-dimensional shape should be considered to be the same. Thus, for example, a 
ligand that bound to the active site binding pocket of AIK would also be expected to bind to 
another binding pocket whose structure coordinates defined a shape that fell within the 
acceptable error. 

[00113] Various computational methods may be used to determine whether a particular 
protein or a portion thereof (referred to here as the **target protein"), typically the binding 
pocket, has a high degree of three-dimensional spatial similarity to another protein (referred to 
here as the "reference protein") against which the target protein is being compared. 
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[00114] The process of comparing a target protein structure to a reference protein 
structure may generally be divided into three steps: 1) defining the equivalent residues and/or 
atoms for the target and reference proteins, 2) performing a fitting operation between the 
proteins; and 3) analyzing the results. These steps are described in more detail below. All 
structure comparisons reported herein and the structure comparisons claimed are intended to be 
based on the particular comparison procedure described below. 

[00115] Equivalent residues or atoms can be determined based upon an alignment of 
primary sequences of the proteins, an alignment of their structural domains or as a combination 
of both. Sequence alignments generally implement the dynamic programming algorithm of 
Needleman and Wunsch [J. Mol Biol. 48: 442-453, 1970]. For the purpose of this invention 
the sequence alignment was performed using the publicly available software program MOE 
(Chemical Computing Group Inc.) package version 2002.3, as described in the accompanying 
User's Manual. When using the MOE program, alignment was performed in the sequence 
editor window using the ALIGN option utilizing the following program parameters: Initial 
pairwise Build-up: ON, Substitution Matrix: Blosum62, Round Robin: ON, Gap Start: 7, Gap 
Extend: 1, Iterative Refinement: ON, Build-up: TREE-BASED, Secondary Structure: NONE, 
Structural Alignment: ENABLED, Gap Start: 1, Gap Extend: 0.1 

[00116] Once aligned, a rigid body fitting operation is performed where the structure for 
the target protein is translated and rotated to obtain an optimum fit relative to the structure of 
the reference protein. The fitting operation uses an algorithm that computes the optimiun 
translation and rotation to be applied to the moving structure, such that the root mean square 
deviation of the fit over the specified pairs of equivalent atoms is an absolute minimum. For 
the purpose of fitting operations made herein, the publicly available software program MOE 
(Chemical Computing Group Inc.) v. 2002.3 was used. 

[00117] The results fi-om this process are typically reported as an RMSD value between 
two sets of atoms. The term "root mean square deviation" means the square root of the 
arithmetic mean of the squares of deviations. It is a way to express the deviation or variation 
jfrom a trend or object. As used herein, an RMSD value refers to a calculated value based on 
variations in the atomic coordinates of a reference protein fi"om the atomic coordinates of a 
reference protein or portions of thereof The stmcture coordinates for AIK, provided in Figure 
3, are used as the reference protein in these calculations. 
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[00118] The same set of atoms was used for initial fitting of the structures and for 
computing root mean square deviation values. For example, if a root mean square deviation 
(RMSD) between Ca atoms of two proteins is needed, the proteins in question should be 
superposed only on the Ca atoms and not on any other set of atoms. Similarly, if an RMSD 
calculation for all atoms is required, the superposition of two structures should be performed 
on all atoms. 

[00119] Based on a review of protein structures deposited in the Protein Databank 
(PDB), 106K was identified as having the smallest RMSD values relative to the structure 
coordinates provided herein. Table 8 below provides a series of RMSD values that were 
calculated by the above described process using the structure coordinates in Figure 3 as the 
reference protein and the stmcture coordinates from PDB code: 106K (Activated Pkb Kinase 
with GSK3 peptide and ANP-PNP) as the target protein. 



TABLES 



AA RESIDUES USED 
TO PERFORM RMSD 
COMPARISON WITH 
PDB:106K 


PORTION OF EACH AA 
RESIDUE USED TO PERFORM 
RMSD COMPARISON WITH 
PDB:106K 


RMSD 

[A] 


Table 2 
(4 Angstrom set) 


alpha-carbon atoms' 


0.89 


main-chain atoms' 


0.85 


all non-hydrogen" 


1.22 


Table 3 
(7 Angstrom set) 


alpha-carbon atoms' 


1.00 


main-chain atoms' 


0.93 


all non-hydrogen 


1.32 


Table 4 
(10 Angstrom set) 


alpha-carbon atoms' 


1.64 


main-chain atoms' 


1.60 


all non-hydrogen^ 


1.86 


126-388 of 
SEQ.IDNo. 1 


alpha-carbon atoms' 


2.33 


main-chain atoms' 


2.31 


all non-hydrogen"^ 


2.38 



^- the RMSD computed between the atoms of all amino acids that are common to both 
the target and the reference in the aligned and superposed structure. The amino acids need not 
to be identical. 

^- the RMSD computed only between identical amino acids, which are common to both 
the target and the reference in the aligned and superposed structure. 
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[00120] It is noted that mutants and variants of AIK as well as other kinase are likely to 
have similar structures despite having different sequences. For example, the binding pockets 
of these related proteins are likely to have similar contours. Accordingly, it should be 
recognized that the structure coordinates and binding pocket models provided herein have 
utility for these other related proteins. 

[00121] Accordingly, in one embodiment, the invention relates to data, computer 
readable media comprising data, and uses of the data where the data comprises all or a portion 
of the structure coordinates shown in Figure 3 or structure coordinates having a root mean 
square deviation (RMSD) equal to or less than the RMSD value specified in Columns 3, 4 or 5 
of Table 1 when compared to the structure coordinates of Figure 3, the root mean square 
deviation being calculated such that the portion of amino acid residues specified in Column 2 
of Table 1 of each set of structure coordinates are superimposed and the root mean square 
deviation is based only on those anMio acid residues in the structure coordinates that are also 
present in the portion of the protein specified in specified in Column 1 of Table 1. 
[00122] As noted, there are many different ways to express the surface contours of the 
AIK structure other than by using the structure coordinates provided in Figure 3. Accordingly, 
it is noted that the present invention is also directed to any data, computer readable media 
comprising data, and uses of the data where the data defines a computer model for a protein 
binding pocket, at least a portion of the computer model having a surface contour that has a 
root mean square deviation equal to or less than a given RMSD value specified in Columns 3, 
4 or 5 of Table 1 when the coordinates used to compute the surface contour are compared to 
the structure coordinates of Figure 3, wherein (a) the root mean square deviation is calculated 
by the calculation method set forth herein, (b) the portion of amino acid residues associated 
with the given RMSD value in Table 1 (specified in Column 2 of Table 1) are superimposed 
according to the RMSD calculation, and (c) the root mean square deviation is calculated based 
only on those amino acid residues present in both the protein being modeled and the portion of 
the protein associated with the given RMSD in Table 1 (specified in Column 1 of Table 1). 

5, AIK-ATPvS Structure 
[00123] The present invention is also directed to a three-dimensional crystal structure of 
AIK. This crystal structure may be used to identify binding sites, to provide mutants having 
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desirable binding properties, and ultimately, to design, characterize, or identify ligands that 
interact with AIK. 

[00124] The three-dimensional crystal structure of AIK may be generated, as is known 
in the art, from the stmcture coordinates shown in Figure 3 and similar such coordinates. 
[00125] The refined crystal structure of AIK-ATPyS determined according to the present 
invention contains amino acids residues 126-388 as numbered according to SEQ. JD No. 1 
(based on the coordinates of Figure 3), one bound ATPyS molecule, and two Mg^^ ions. A 
total of 73 water molecules were included. 

[00126] Figure 4 illustrates a ribbon diagram overview of the structure of AIK, 
highlighting the secondary structural elements of the protein. As can be seen, the structure 
exhibits bilobal architecture typical of protein kinase catalytic domains. The smaller N- 
terminal lobe contains a five-stranded anti-parallel p-sheet (jJl- p5) and an a-helix (aC), The 
larger C-terminal lobe consists of eight a-helices (aD- oJ). The C-terminal lobe contains 
functionally important loop regions: the glycine-rich nucleotide binding loop, the catalytic loop 
and the activation loop (A-loop) involved in polypeptide substrate binding. The nucleotide 
ligands bind in a cleft between the two lobes. 

[00127] Kinases show considerable variability in the relative orientation of the N and C 
lobes, in the position and orientation of the aC, and in the conformation of the activation loop. 
This relative orientation of the N- and C-terminal lobes is important in kinase function. A 
catalytically active conformation is generally a closed structure in which the two lobes clamp 
together bringing conserved residues into catalytically optimal positions. In particular, in the 
active conformation, the aC helix becomes parallel with the cleft between the lobes and makes 
tertiary contacts with the C-lobe. In the inactive conforaiation observed in several 
unphosphorylated kinase structures the two lobes are spaced apart at a much higher angle and 
the aC helix is rotated away from the C-lobe. 

[00128] The conformation of AIK closely resembles the closed, active conformations of 
cAPK, and cyclin-dependent kinase 2 (cdk2). The aC helix adopts the active conformation. 
The two lobes clamp together, bringing residues of the catalytic cleft into alignment. 
[00129] Figure 5 illustrates ATP7S bound in the active site of AIK based on the 
determined crystal structure for the molecule in the asymmetric unit corresponding to the 
structure coordinates shown in Figure 3. 
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[00130] The aC helix is positioned so that the conserved residue El 81 makes a salt 
bridge with K162, in close proximity to the P-phosphate of the boimd ADP ligand (Figure 4). 
The activation loop (A-loop; residues 274-297) is well ordered with the exception of residues 
286 and 287. The loop is stabilized by numerous interactions both within the loop and with 
other regions of the kinase. Residues 278-279 and 282-283 form two short stretches of anti- 
parallel beta sheet with residues 252-253 and 305-306 respectively, and the side chain of R285 
forms a hydrogen bond with the backbone of 1301. Those contacts anchor the N-terminal 
portion of the activation loop to the core of the C-terminal lobe. The conformation of the A- 
loop allows access to the substrate-binding site. The observation of an active AIK 
conformation was unanticipated since most kinases require phosphorylation of their activation 
loop to adopt the active conformation. Although imphosphorylated protein was used in 
crystallization, there is a free phosphate ion bound 1.3 A from the Oy of T288. The phosphate 
is bound in a pocket created by side chains of R180, HI 76 and R255. These side chains are in 
positions analogous to those in phosphorylated, activated kinases, suggesting that the 
phosphate-bound AIK structure mimics a catalytically active conformation. 
[00131] The similarity of the phosphate binding pocket and the overall structure of AIK 
and other activated kinases suggests that the phosphate-bovmd AIK structure mimics the active, 
phosphorylated state of the enzyme. It is likely that the activation of AIK follows a similar 
mechanism as the activation of cdk2 and cAPK. In the unphosphorylated state, the A-loop 
blocks the access of the substrate to the catalytic cleft. Upon phosphorylation, the phosphate 
group acts as an organizing center that induces conformational change in the A-loop by 
moving it into a position that allows productive substrate binding and catalysis. 

6. AIK Binding Pocket and Ligand Interaction 
[00132] The term "binding site" or "binding pocket", as the terms are used herein, refers 
to a region of a protein that, as a result of its shape, favorably associates with a ligand or 
substrate. The term "AIK-like binding pocket" refers to a portion of a molecule or molecular 
complex whose shape is sufficiently similar to the AIK binding pockets as to bind common 
ligands. This commonality of shape may be quantitatively defined by a root mean square 
deviation (rmsd) from the structure coordinates of the backbone atoms of the anfiino acids that 
make up the binding pockets in AIK (as set forth in Figure 3). 
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[00133] The "active site binding pockets" or "active site" of ADC refers to the area on the 
surface of AIK where the substrate binds. 

[00134] Figure 5 illustrates ATPyS bound in the active site of AIK based on the 
determined crystal structure of the present invention. As can be seen, ATPyS binds in a cleft 
between the two lobes and is coordinated by residues on the N-terminal lobe. Residues in the 
C-terminal lobe coordinate substrate binding and catalysis. Although the ATPyS was used 
during crystallization, the y-phosphate is not visible in the structure suggesting hydrolysis 
driven by a high pH of the crystallization conditions. The ATPyS interacts with the protein 
through both direct and water mediated contacts. The adenine base inserts into a hydrophobic 
pocket fomied by the side chains of L139, V147, L210, Y212, A213 and L263. The Nl 
nitrogen and amino group of the adenine ring hydrogen bond to the main chain of A213 and 
E21 1. The ribose moiety is anchored to the protein via hydrogen bonding between 3 'OH and 
the carbonyl of E260. The active site of AIK displays a network of ordered water molecules 
that participate in hydrogen bonding within the binding site. Two fully coordinated Mg^^ ions 
are present in the ADC structure. Both metal ions are directly coordinated by the side chain of 
D274 and bridge the a- and p- phosphates of ATPyS. 

[00135] The ATP binding site of protein kinases is a primary target for the design of 
small molecule inhibitors. The ATP binding site appears well conserved among protein 
kinases and involves residues protmding from the pl-p2-p3 sheet, helbc C, the loop region 
linking p5 and the C-lobe, and the catalytic loop. The structure of the ATP binding pocket in 
the AIK- ATPyS complex shows considerable sequence variability with other kinases, which is 
reflective of diversity among kinase sub-families. The ATP binding cleft shows subtle 
differences in ATP site architecture that may be explored to confer specificity of inhibition. 
The position of the bound ATPyS is similar, but not identical, to other structures. 
[00136] In resolving the crystal structure of AIK in complex with ATPyS, applicants 
determined that AIK amino acids in Table 2 (above) are within 4 Angstroms of and therefore 
close, enough to interact with ATPyS. Applicants have also determined that the amino acids of 
Table 3 (above) are within 7 Angstroms of bound ATPyS and therefore are also close enough 
to interact with that substrate or analogs thereof. Further it has been determined that the amino 
acids of Table 4 (above) are within 10 Angstroms of the boimd ATPyS. The 4, 7, and/or 10 
Angstroms sets of amino acids are preferably conserved in variants of AIK. While it is 
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desirable to largely conserve these residues, it should be recognized however that variants may 
also involve varying 1, 2, 3, 4 or more of the residues set forth in Tables 2, 3, and 4 in order to 
evaluate the roles these amino acids play in the binding pocket. 

[00137] With the knowledge of the AIK crystal structure provided herein, Applicants 
define an AIK binding pocket as a binding pocket where the relative positioning of the 4, 7, 
and/or 10 Angstroms sets of amino acids are substantially conserved. Again, it is noted that it 
may be desirable to form variants where 1, 2, 3, 4 or more of the residues set forth in Tables 2, 
3, and 4 are varied in order to evaluate the roles these amino acids play in the binding pocket. 
Accordingly, any set of structure coordinates for a protein from any source having a root mean 
square deviation of non-hydrogen atoms of less than 2 A when superimposed on the non- 
hydrogen atom positions of the corresponding atomic coordinates of Figure 3 for the 4, 7, 
and/or 10 Angstroms sets of amino acids shall be considered identical. As noted previously, 
the root mean square deviation is intended to be limited to only those non-hydrogen atoms of 
amino acid residues that are common to both the protein fragment represented in Figure 3 and 
the protein whose structure coordinates are being compared to the coordinates shown in Figure 
3 since the sequence of the protein may be varied somewhat. 

[00138] Accordingly, in one embodiment, the invention relates to data, computer 
readable media comprising data, and uses of the data where the data comprises the structure 
coordinates shown in Figure 3 or structure coordinates having a root mean square deviation of 
non-hydrogen atoms of less than 3 A when superimposed on the non-hydrogen atom positions 
of the corresponding atomic coordinates of Figure 3 for the 4, 7, and/or 10 Angstroms sets of 
amino acids. 

[00139] Again, it is noted that the root mean square deviation is intended to be limited to 
only those non-hydrogen atoms of amino acid residues that are common to both the protein 
fragment represented in one or more of the tables and the protein whose structure coordinates 
are being compared to the coordinates shown in Figure 3. 

[00140] As noted above, there are many different ways to express the surface contours 
of the AIK structure other than by using the structure coordinates provided in Figure 3. 
Accordingly, it is noted that the present invention is also directed to any data, computer 
readable media comprising data, and uses of the data where the data defines a computer model 
for a protein binding pocket, at least a portion of the computer model having a surface contoiu" 
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that has a root mean square deviation of less than 3 A when superimposed on a sxirface contour 
defined by atomic coordinates of Figure 3, the root mean square deviation being calculated 
based only on non-hydrogen atoms in the structure coordinates of Figure 3 that are present in 
residues shown in Tables 2, 3, and/or 4. 

[00141] Optionally, the root mean square deviation of non-hydrogen atoms is less than 
1.5 A, 1 A, 0.5 A, or less. 

[00142] It will be readily apparent to those of skill in the art that the numbering of amino 
acids in other isoforms of AIK may be different than that set forth for AIK. Corresponding 
amino acids in other isoforms of AIK are easily identified by visual inspection of the amino 
acid sequences or by using commercially available homology software programs, as further 
described below. 

?• System For Displaying the Three Dimensional Structure of AIK 
[00143] The present invention is also directed to machine-readable data storage media 
having data storage material encoded with machine-readable data that comprises structure 
coordinates for AIK. The present invention is also directed to a machine readable data storage 
media having data storage material encoded with machine readable data, which, when read by 
an appropriate machine, can display a three dimensional representation of a structure of AIK. 
[00144] All or a portion of the AIK coordinate data shown in Figure 3, when used in 
conjunction with a computer programmed with software to translate those coordinates into the 
three-dimensional structure of AIK may be used for a variety of purposes, especially for 
purposes relating to dmg discovery. Software for generating three-dimensional graphical 
representations are known and commercially available. The ready use of the coordinate data 
requires that it be stored in a computer-readable format. Thus, in accordance with the present 
invention, data capable of being displayed as the three-dimensional structure of AIK and/or 
portions thereof and/or their structurally similar variants may be stored in a machine-readable 
storage medium, which is capable of displaying a graphical three-dimensional representation of 
the structure. 

[00145] For example, in various embodiments, a computer is provided for producing a 
three-dimensional representation of at least an ADC-like binding pocket, the computer 
comprising: 
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machine readable data storage medium comprising a data storage material encoded with 
machine-readable data, the machine readable data comprising structure coordinates that have a 
root mean square deviation equal to or less than the RMSD value specified in Columns 3, 4 or 
5 of Table 1 when compared to the structure coordinates of Figure 3, the root mean square 
deviation being calculated such that the portion of amino acid residues specified in Column 2 
of Table 1 of each set of structure coordinates are superimposed and the root mean square 
deviation is based only on those amino acid residues in the stmcture coordinates that are also 
present in the portion of the protein specified in specified in Column 1 of Table 1 ; 

a working memory for storing instructions for processing the machine-readable data; 

a central-processing unit coupled to the working memory and to the machine-readable 
data storage medium, for processing the machine-readable data into the three-dimensional 
representation; and 

an output hardware coupled to the central processing unit, for receiving the three 
dimensional representation. 

[00146] Another embodiment of this invention provides a machine-readable data storage 
medium, comprising a data storage material encoded with machine readable data which, when 
used by a machine programmed with instructions for using said data, displays a graphical 
three-dimensional representation comprising AIK or a portion or variant thereof 
[00147] hi various variations, the machine readable data comprises data for representing 
a protein based on stmcture coordinates where the stmcture coordinates have a root mean 
square deviation equal to or less than the RMSD value specified in Columns 3, 4 or 5 of Table 
1 when compared to the stmcture coordinates of Figure 3, the root mean square deviation being 
calculated such that the portion of amino acid residues specified in Column 2 of Table 1 of 
each set of stracture coordinates are superimposed and the root mean square deviation is based 
only on those amino acid residues in the structure coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table L 

[00148] According to another embodiment, the machine-readable data storage medium 
comprises a data storage material encoded with a first set of machine readable data which 
comprises the Fourier transform of stmcture coordinates that have a root mean square deviation 
equal to or less than the RMSD value specified in Columns 3, 4 or 5 of Table 1 when 
compared to the stracture coordinates of Figure 3, the root mean square deviation being 
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calculated such that the portion of amino acid residues specified in Column 2 of Table 1 of 
each set of structure coordinates are superimposed and the root mean square deviation is based 
only on those amino acid residues in the structure coordinates that are also present in the 
portion of the protein specified in specified in Column 1 of Table 1, and which, when using a 
machine programmed with instructions for using said data, can be combined with a second set 
of machine readable data comprising the X-ray diffraction pattem of another molecule or 
molecular complex to determine at least a portion of the structure coordinates corresponding to 
the second set of machine readable data. For example, the Fourier transform of the structure 
coordinates set forth in Figure 3 may be used to determine at least a portion of the structure 
coordinates of other AIK-like enzymes, and isoforms of AIK. 

[00149] Optionally, a computer system is provided in combination with the machine- 
readable data storage medium provided herein. In one embodiment, the computer system 
comprises a working memory for storing instructions for processing the machine-readable data; 
a processing unit coupled to the working memory and to the machine-readable data storage 
medium, for processing the machine-readable data into the three-dimensional representation; 
and an output hardware coupled to the processing unit, for receiving the three-dimensional 
representation. 

[00150] Figure 6 illustrates an example of a computer system that may be used in 
combination with storage media according to the present invention. As illustrated, the 
computer system 10 includes a computer 1 1 comprising a central processing unit ("CPU") 20, a 
working memory 22 which may be, e.g., RAM (random-access memory) or "core" memory, 
mass storage memory 24 (such as one or more disk drives or CD-ROM drives), one or more 
cathode-ray tube ("CRT") display terminals 26, one or more keyboards 28, one or more input 
lines 30, and one or more output lines 40, all of which are interconnected by a conventional bi- 
directional system bus 50. 

[00151] Input hardware 36, coupled to computer 11 by input lines 30, may be 
implemented in a variety of ways. For example, machine-readable data of this invention may 
be inputted via the use of a modem or modems 32 connected by a telephone line or dedicated 
data line 34. Alternatively or additionally, the input hardware 36 may comprise CD-ROM 
drives or disk drives 24. In conjunction with display terminal 26, keyboard 28 may also be 
used as an input device. 
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[00152] Conventional devices may, similarly implement output hardware 46, coupled to 
computer 1 1 by output lines 40. By way of example, output hardware 46 may include CRT 
display terminal 26 for displaying a graphical representation of a binding pocket of this 
invention using a program such as MOE as described herein. Output hardware might also 
include a printer 42, so that hard copy output may be produced, or a disk drive 24, to store 
system output for later use. 

[00153] In operation, CPU 20 coordinates the use of the various input and output 
devices 36, 46 coordinates data accesses from mass storage 24 and accesses to and from 
working memory 22, and determines the sequence of data processing steps. A number of 
programs may be used to process the machine-readable data of this invention. Such programs 
are discussed in reference to using the three dimensional structure of AIK described herein. 
[00154] The storage medium encoded with machine-readable data according to the 
present invention can be any conventional data storage device known in the art. For example, 
the storage medium can be a conventional floppy diskette or hard disk. The storage medium 
can also be an optically readable data storage medium, such as a CD-ROM or a DVD-ROM, or 
a rewritable mediimi such as a magneto-optical disk that is optically readable and magneto- 
optically writable. 

8. Uses of the Three Dimensional Structure of AIK 
[00155] The three-dimensional crystal structure of the present invention may be used to 
identify ADC binding sites, be used as a molecular replacement model to solve the structure of 
unknown crystallized proteins, to design mutants having desirable binding properties, and 
ultimately, to design, characterize, identify entities capable of interacting with AIK and other 
structurally similar proteins as well as other uses that would be recognized by one of ordinary 
skill in the art. Such entities may be chemical entities or proteins. The term "chemical entity", 
as used herein, refers to chemical compoimds, complexes of at least two chemical compounds, 
and fragments of such compounds. 

[00156] The AIK stmcture coordinates provided herein are useful for screening and 
identifying drugs that inhibit AIK and other stracturally similar proteins. For example, the 
structure encoded by the data may be computationally evaluated for its ability to associate with 
putative substrates or ligands. Such compounds that associate with ADC may inhibit AIK, and 
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are potential drug candidates. Additionally or alternatively, the structure encoded by the data 
may be displayed in a graphical three-dimensional representation on a computer screen. This 
allows visual inspection of the structure, as well as visual inspection of the structure's 
association with the compounds. 

[001571 Thus, according to another embodiment of the present invention, a method is 
provided for evaluating the potential of an entity to associate with AIK or a fragment or variant 
thereof by using all or a portion of the structure coordinates provided in Figure 3 or functional 
equivalents thereof. A method is also provided for evaluating the potential of an entity to 
associate with ADC or a fragment or variant thereof by using structure coordinates similar to all 
or a portion of the structure coordinates provided in Figure 3 or functional equivalents thereof 
[00158] The method may optionally comprise the steps of: creating a computer model of 
all or a portion of a protein structure (e.g., a binding pocket) using structure coordinates 
according to the present invention; performing a fitting operation between the entity and the 
computer model; and analyzing the results of tiie fitting operation to quantify the association 
between the entity and the model. The portion of the protein structure used optionally 
comprises all of the amino acids listed in Tables 2, 3 and 4 that are present in the structure 
coordinates being used. 

[001591 It is noted that the computer model may not necessarily directly use the 
structure coordinates. Rather, a computer model can be formed that defines a surface contour 
that is the same or similar to the surface contour defmed by the structure coordinates. 
[00160] The structure coordinates provided herein can also be utilized in a method for 
identifying a ligand (e.g., entities capable of associating with a protein) of a protein comprising 
an AIK-like binding pocket. One embodiment of the method comprises: using all or a portion 
of the structure coordinates provided herein to generate a three-dimensional structure of an 
AIK-like binding pocket; employing the three-dimensional structure to design or select a 
potential ligand; synthesizing the potential ligand; and contacting the synthesized potential 
ligand with a protein comprising an AIK-like binding pocket to determine the ability of the 
potential ligand to interact with protein. According to this method, the structure coordinates 
used may have a root mean square deviation equal to or less than the RMSD values specified in 
Columns 3, 4 or 5 of Table 1 when compared to the structure coordinates of Figure 3 according 
to the RMSD calculation method set forth herein, provided that the portion of amino acid 
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residues specified in Column 2 of Table 1 of each set of structure coordinates are superimposed 
and the root mean square deviation is calculated based only on those amino acid residues in the 
structure coordinates that are also present in the portion of the protein specified in Column 1 of 
Table 1. The portion of the protein structure used optionally comprises all of the amino acids 
hsted in Tables 2, 3, and/or 4 that are present. 

[00161] As noted previously, the three-dimensional structure of an ADC-like binding 
pocket need not be generated directly from structure coordinates. Rather, a computer model 
can be formed that defines a surface contour that is the same or similar to the surface contour 
defined by the structure coordinates. 

[00162] A method is also provided for evaluating the ability of an entity, such as a 
compound or a protein to associate with an ADC-like binding pocket, the method comprising: 
constructing a computer model of a binding pocket defined by structure coordinates that have a 
root mean square deviation equal to or less than the RMSD value specified in Columns 3, 4 or 
5 of Table 1 when compared to the structure coordinates of Figure 3, the root mean square 
deviation being calculated such that the portion of amino acid residues specified in Colunm 2 
of Table 1 of each set of structure coordinates are superimposed and the root mean square 
deviation is based only on those amino acid residues in the structure coordinates that are also 
present in the portion of the protein specified in specified in Column 1 of Table 1; selecting an 
entity to be evaluated by a method selected from the group consisting of (i) assembling 
molecular fragments into the entity, (ii) selectmg an entity from a small molecule database, (iii) 
de novo ligand design of the entity, and (iv) modifying a known ligand for ADC, or a portion 
thereof; performing a fitting program operation between computer models of the entity to be 
evaluated and the binding pocket in order to provide an energy-minirnized configuration of the 
entity in the binding pocket; and evaluating the results of the fitting operation to quantify the 
association between the entity and the binding pocket model in order to evaluate the ability of 
the entity to associate with the said binding pocket. 

[00163] The computer model of a binding pocket used in this embodiment need not be 
generated directly from structure coordinates. Rather, a computer model can be formed that 
defines a surface contour that is the same or similar to the surface contour defined by the 
structure coordinates. 
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[00164] Also according to the method, the method may further mclude synthesizing the 
entity; and contacting a protein having an AIK-like binding pocket with the synthesized entity. 
[00165] With the structure provided herein, the present invention for the first time 
permits the use of molecular design techniques to identify, select or design potential inhibitors 
of AIK, based on the structure of an AIK-like binding pocket. Such a predictive model is 
valuable in light of the high costs associated with the preparation and testing of the many 
diverse compounds that may possibly bind to the AIK protein. 

[00166] According to this invention, a potential AIK inhibitor may now be evaluated for 

its ability to bind an AIK-like binding pocket prior to its actual synthesis and testing. If a 
proposed entity is predicted to have insufficient interaction or association with the binding 
pocket, preparation and testing of the entity can be obviated. However, if the computer 
modeling indicates a strong interaction, the entity may then be obtained and tested for its 
ability to bind. 

[00167] A potential inhibitor of an AIK-like binding pocket may be computationally 
evaluated using a series of steps in which chemical entities or fragments are screened and 
selected for their ability to associate with the AIK-like binding pockets. 
[00168] One skilled in the art may use one of several methods to screen entities (whether 
chemical or protein) for their ability to associate with an AIK-like binding pocket. This 
process may begin by visual inspection of, for example, an AIK-like binding pocket on a 
computer screen based on the ADC structure coordinates in Figure 3 or other coordinates which 
define a similar shape generated from the machine-readable storage medium. Selected 
fragments or chemical entities may then be positioned in a variety of orientations, or docked, 
within that binding pocket as delBned above. Docking may be accomplished using software 
such as Quanta and Sybyl, followed by energy minimization and molecular dynamics with 
standard molecular mechanics force fields, such as CHARMM and AMBER. 
[00169] Specialized computer programs may also assist in the process of selecting 
entities. These include: GRID (P. J. Goodford, "A Computational Procedure for Determining 
Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. Med. 
Chem., 28, pp. 849-857 (1985)). GRID is available from Oxford University, Oxford, UK; 
MCSS (A. Miranker et al., "Functionality Maps of Binding Sites: A Multiple Copy 
Simultaneous Search Method." Proteins: Structure, Function and Genetics, 11, pp. 29-34 
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(1991)). MCSS is available from Molecular Simulations, San Diego, Calif.; AUTODOCK (D. 
S. Goodsell et al., "Automated Docking of Substrates to Proteins by Simulated Annealing", 
Proteins: Structure, Function, and Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available 
from Scripps Research Institute, La JoUa, Calif.; & DOCK (1. D. Kuntz et al., "A Geometric 
Approach to Macromolecule-Ligand Interactions", J. Mol. Biol., 161, pp. 269-288 (1982)). 
DOCK is available from University of California, San Francisco, Calif. 
[00170] Once suitable entities have been selected, they can be designed or assembled. 
Assembly may be preceded by visual inspection of the relationship of the fragments to each 
other on the three-dunensional image displayed on a computer screen in relation to the 
structure coordinates of AIK. This may then be followed by manual model building using 
software such as MOE, QUANTA or Sybyl [Tripos Associates, St. Louis, Mo]. 
[00171] Useful programs to aid one of skill in the art in connecting the individual 
chemical entities or fragments include: CAVEAT (P. A. Bartlett et al, "CAVEAT: A Program 
to Facilitate the Structure-Derived Design of Biologically Active Molecules", in "Molecular 
Recognition in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 
182-196 (1989); G. Lauri and P. A, Bartlett, "CAVEAT: a Program to Facilitate the Design of 
Organic Molecules", J. Comput. Aided Mol Des,, 8, pp. 51-66 (1994)). CAVEAT is available 
from the University of California, Berkeley, Calif.; 3D Database systems such as ISIS (MDL 
Information Systems, San Leandro, Calif.). This area is reviewed in Y. C. Martin, "3D 
Database Searching in Drug Design", J. Med. Chem., 35, pp. 2145-2154 (1992); HOOK (M. B. 
Eisen et al, "HOOK: A Program for Finding Novel Molecular Architectures tiiat Satisfy the 
Chemical and Steric Requirements of a Macromolecule Binding Site", Proteins: Struct., Funct., 
Genet., 19, pp. 199-221 (1994). HOOK is available from Molecular Simulations, San Diego, 
Calif. 

[00172] Instead of proceeding to build an inhibitor of an AIK-like binding pocket in a 
step-wise fashion one fragment or entity at a time as described above, inhibitory or other AIK 
binding compounds may be designed as a whole or ''de novo" using either an empty binding 
site or optionally including some portion(s) of a known inhibitor(s). There are many de novo 
ligand design methods including: LUDI (H.-J. Bohm, "The Computer Program LUDI: A New 
Method for the De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 
61-78 (1992)). LUDI is available from Molecular Simulations Incorporated, San Diego, Calif.; 
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LEGEND (Y. Nishibata et al, Tetrahedron, 47, p. 8985 (1991)). LEGEND is available from 
Molecular Simulations Incorporated, San Diego, Calif; LEAPFROG (available from Tripos 
Associates, St. Louis, Mo.); & SPROUT (V. Gillet et al, "SPROUT: A Program for Structure 
Generation)", J. Comput. Aided Mol. Design, 7, pp. 127-153 (1993)). SPROUT is available 
from the University of Leeds, UK. 

[00173] Other molecular modeling techniques may also be employed in accordance with 
this invention (see, e.g., Cohen et al., "Molecular Modeling Software and Methods for 
Medicinal Chemistry, J. Med. Chem., 33, pp. 883-894 (1990); see also, M. A. Navia and M. A. 
Murcko, "The Use of Structural Information in Drug Design", Current Opinions in Structural 
Biology, 2, pp. 202-210 (1992); L. M. Balbes et al, "A Perspective of Modem Methods in 
Computer- Aided Drug Design", in Reviews in Computational Chemistry, Vol. 5, K. B. 
Lipkowitz and D. B. Boyd, Eds., VCH, New York, pp. 337-380 (1994); see also, W. C. Guida, 
"Software For Structure-Based Drug Design", Curr. Opin. Struct. Biology, 4, pp. 777-781 
(1994)). 

[00174] Once an entity has been designed or selected, for example, by the above 
methods, the efficiency with which that entity may bind to an AIK binding pocket may be 
tested and optimized by computational evaluation. For example, an effective AIK binding 
pocket inhibitor preferably demonstrates a relatively small difference in energy between its 
bound and free states (i.e., a small deformation energy of binding). Thus, the most efficient 
AIK binding pocket inhibitors should preferably be designed with deformation energy of 
binding of not greater than about 10 kcal/mole, more preferably, not greater than 7 kcal/mole. 
ADC binding pocket inhibitors may interact with the binding pocket in more than one of 
multiple conformations that are similar in overall binding energy. In those cases, the 
deformation energy of binding is taken to be the difference between the energy of the free 
entity and the average energy of the conformations observed when the inhibitor binds to the 
protein, 

[00175] An entity designed or selected as binding to an AIK binding pocket may be 
further computationally optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target enzyme and with the surrounding water molecules. 
Such non-complementary electrostatic interactions include repulsive charge-charge, dipole- 
dipole and charge-dipole interactions. 
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[00176] Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interactions. Examples of programs designed for such 
uses include: Gaussian 94, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa. 
COPYRGT.1995); AMBER, version 4.1 (P. A. Kolhnan, University of California at San 
Francisco, COPYRGT 1995); QUANTA/CHARMM (Molecular Simulations, Inc., San Diego, 
Calif. COPYRGT.1995); Insight II/Discover (Molecular Simulations, Inc., San Diego, Calif. 
COPYRGT.1995); DelPhi (Molecular Simulations, Inc., San Diego, Calif. COPYRGT.1995); 
and AMSOL (Quantum Chemistry Program Exchange, Indiana University). These programs 
may be implemented, for instance, using a Silicon Graphics workstation such as an 
Indigo.sup.2 with "IMPACT" graphics. Other hardware systems and software packages will be 
known to those skilled in the art. 

[00177] Another approach provided by this invention, is the computational screening of 
small molecule databases for chemical entities or compounds that can bind in whole, or in part, 
to an AIK binding pocket. In this screening, the quality of fit of such entities to the binding 
site may be judged either by shape complementarities or by estimated interaction energy [E. C. 
Meng et al., J. Comp. Chem., 13, 505-524 (1992)]. 

[00178] According to another embodiment, the invention provides compounds tiiat 
associate with an AIK-like binding pocket produced or identified by various methods set forth 
above. 

[00179] The structure coordinates set forth in Figure 3 can also be used to aid in 
obtaining structural information about another crystallized molecule or molecular complex. 
This may be achieved by any of a number of well-known techniques, including molecular 
replacement. 

[00180] For example, a method is also provided for utilizing molecular replacement to 
obtain stmctural information about a protein whose structure is unknown comprising the steps 
of: generating an X-ray diffraction pattern of a crystal of the protein whose structure is 
unknown; generating a three-dimensional electron density map of the protein whose structure 
is unknown from the X-ray diffraction pattern by using at least a portion of the stmcture 
coordinates set forth in Figure 3 as a molecular replacement model. 

[00181] By using molecular replacement, all or part of the structure coordinates of the 
AIK provided by this invention (and set forth in Figure 3) can be used to determine the 
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structure of another crystallized molecule or molecular complex more quickly and efficiently 
than attempting an ab initio structure determination. One particular use includes use with other 
structurally similar proteins. Molecular replacement provides an accurate estimation of the 
phases for an unknown structure. Phases are a factor in equations used to solve crystal 
structures that cannot be determined directly. Obtaining accurate values for the phases, by 
methods other than molecular replacement, is a time-consuming process that involves iterative 
cycles of approximations and refinements and greatly hinders the solution of crystal structures. 
However, when the crystal structure of a protein containing at least a homologous portion has 
been solved, the phases from the known structure provide a satisfactory estimate of the phases 
for the unknown structure. 

[00182] Thus, this method involves generating a preliminary model of a molecule or 
molecular complex whose structure coordinates are unknown, by orienting and positioning the 
relevant portion of AIK according to Figure 3 within the imit cell of the crystal of the unknown 
molecule or molecular complex so as best to account for the observed X-ray diffraction pattern 
of the crystal of the molecule or molecular complex whose structure is imknown. Phases can 
then be calculated from this model and combined with the observed X-ray diffraction pattem 
amplitudes to generate an electron density map of the structure whose coordinates are 
unknown. This, in turn, can be subjected to any well-known model building and structure 
refinement techniques to provide a final, accurate structure of the unknown crystallized 
molecule or molecular complex [E. Lattman, "Use of the Rotation and Translation Functions^\ 
in Meth. EnzymoL, 115, pp. 55-77 (1985); M. G. Rossmann, ed., "The Molecular Replacement 
Method", Int. Sci. Rev. Ser., No. 13, Gordon & Breach, New York (1972)]. 
[00183] The structure of any portion of any crystallized molecule or molecular complex 
that is sufficiently homologous to any portion of AIK can be resolved by this method 
[00184] In one embodiment, the method of molecular replacement is utilized to obtain 
structural information about the present invention and any other AIK-like molecule. The 
structure coordinates of AIK, as provided by this invention, are particularly usefiil in solving 
the structure of other isoforms of AIK or AIK complexes. 

[00185] The stmcture coordinates of AIK as provided by this invention are useful in 
solving the structure of AIK variants that have amino acid substitutions, additions and/or 
deletions (referred to collectively as "AIK mutants", as compared to naturally occurring AIK). 

44 

Express Mailing No. EV327522975US 



PATENT 



SYR'AIK'5001'Cl 



These AIK mutants may optionally be crystallized in co-complex with a ligand, such as an 
inhibitor, substrate analogue or a suicide substrate. The crystal structures of a series of such 
complexes may then be solved by molecular replacement and compared with that of AIK. 
Potential sites for modification within the various binding sites of the enzyme may thus be 
identified. This information provides an additional tool for determining the most efficient 
binding interactions such as, for example, increased hydrophobic interactions, between AIK 
and a ligand. It is noted that the ligand may be the protein's natural ligand or may be a 
potential agonist or antagonist of a protein, 

[00186] All of the complexes referred to above may be studied using well-known X-ray 
diffraction techniques and may be refined versus 1 .5-3A resolution X-ray data to an R value of 
about 0.22 or less using computer software, such as X-PLOR [Yale University, 
COPYRIGHT. 1992, distributed by Molecular Simulations, Inc.; see, e.g., Blundell & Johnson, 
supra; Meth. EnzymoL, Vol. 114 & 115, H. W. Wyckoff et al., eds., Academic Press (1985)]. 
This information may thus be used to optimize known AIK inhibitors, and more importantly, to 
design new AIK inhibitors. 

[00187] The structure coordinates described above may also be used to derive the 
dihedral angles, phi and psi, that define the conformation of the amino acids in the protein 
backbone. As will be understood by those skilled in the art, the phin angle refers to the rotation 
around the bond between the alpha-carbon and the nitrogen, and the psin angle refers to the 
rotation around the bond between the carbonyl carbon and the alpha-carbon. The subscript "n" 
identifies the amino acid whose conformation is being described [for a general reference, see 
Blimdell and Johnson, Protein Crystallography, Academic Press, London, 1976]. 

9. Uses of the Crystal and Diffraction Pattern of AIK 
[00188] Crystals, crystallization conditions and the diffraction pattern of AIK that can be 
generated fi*om the crystals also have a range of uses. One particular use relates to screening 
entities that are not known ligands of AIK for their ability to bind to AIK. For example, with 
the availability of crystallization conditions, crystals and diffraction pattems of ADC provided 
according to the present invention, it is possible to take a crystal of AIK; expose the crystal to 
one or more entities that may be a ligand of ADC; and determine whether a Ugand/ AIK 
complex is formed. The crystals of ADC may be exposed to potential ligands by various 
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methods, including but not limited to, soaking a crystal in a solution of one or more potential 
ligands or co-crystallizing AIK in the presence of one or more potential ligands. Given the 
structure coordinates provided herein, once a ligand complex is formed, the structure 
coordinates can be used as a model in molecular replacement in order to determine the 
structure of the ligand complex. 

[00189] Once one or more ligands are identified, structural information firom the ligand/ 
AIK complex(es) may be used to design new ligands that bind tighter, bind more specifically, 
have better biological activity or have better safety profile than known ligands. 
[00190] In one embodiment, a method is provided for identifying a ligand that binds to 
AIK comprising: (a) attempting to crystallize a protein that comprises a sequence wherein at 
least a portion of the sequence has 55%, 65%, 75%, 85%, 90%, 95%, 97%, 99% or greater 
identity with SEQ. ID No. 3 in the presence of one or more entities; (b) if crystals of the 
protein are obtained in step (a), obtaining an X-ray diflfraction pattern of the protein crystal; 
and (c) determining whether a ligand/protein complex was formed by comparing an X-ray 
diffraction pattern of a crystal of the protein formed in the absence of the one or more entities 
to the crystal formed in the presence of the one or more entities. 

[00191] In another embodiment, a method is provided for identifying a ligand that binds 
to AIK comprising: soaking a crystal of a protein wherein at least a portion of the protein has 
55%, 65%, 75%, 85%, 90%, 95%, 97%, 99% or greater identity with SEQ. ID No. 3 with one 
or more entities; determining whether a ligand/protein complex was formed by comparing an 
X-ray diffi-action pattern of a crystal of the protein that has not been soaked with the one or 
more entities to the crystal that has been soaked with the one or more entities. 
[00192] Optionally, the method may further comprise converting the diffraction patterns 
into electron density maps using phases of the protein crystal and comparing the electron 
density maps. 

[00193] Libraries of "shape-diverse" compounds may optionally be used to allow direct 
identification of the ligand-receptor complex even when the ligand is exposed as part of a 
mixture. According to this variation, the need for time-consuming de-convolution of a hit from 
the mixture is avoided. More specifically, the calculated electron density function reveals the 
binding event, identifies the bound compound and provides a detailed 3-D stmcture of the 
ligand-receptor complex. Once a hit is foxmd, one may optionally also screen a number of 
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analogs or derivatives of the hit for tighter binding or better biological activity by traditional 
screening methods. The hit and information about the structure of the target may also be used 
to develop analogs or derivatives with tighter binding or better biological activity. It is noted 
that the ligand-AIK complex may optionally be exposed to additional iterations of potential 
ligands so that two or more hits can be linked together to make a more potent ligand. 
Screening for potential ligands by co-crystallization and/or soaking is further described in U.S. 
Patent No. 6,297,021, which is incorporated herein by reference. 

EXAMPLES 
Example 1. Expression and Purification of AIK 
[00194] This example describes the expression of AIK. It should be noted that a variety 
of other expression systems and hosts are also suitable for the expression of AIK, as would be 
readily appreciated by one of skill in the art. 

[00195] The portion of the gene encoding residues 125-391 (from SEQ. ID No. 1) which 
correspond to the catalytic domains of human AIK was isolated from cDNA libraries (testis) 
by PCR and cloned into the BaniR I site of pFastbacHTb (Gibco-BRL). This DNA sequence is 
presented in Figure 1 as SEQ. ID No. 2. 

[00196] Expression in this vector generated a fusion of the kinase domain with a 
cleavable (rTev) N-terminal 6x-histidine tag residues, the amino acid sequence of which is 
shown in Figure 1 as SEQ. ID. 3. Recombinant baculoviruses incorporating the kinase cDNA 
constructs were generated by transposition using the Bac-to-Bac system (Gibco-BRL). High- 
titer viral stocks were generated by infection of Spodoptera frugiperda Sf9 cells and the 
expression of recombinant protein was carried out by infection of Trichoplusia ni Hi5 cells 
(Gibco-BRL) in 5L Wave Bioreactors (Wave Biotech). Recombinant proteins were isolated 
from cellular extracts by passage over ProBond (InVitrogen) resin. It is noted that the 
polyhistidine tags may optionally be removed by treatment with rTEV protease (InVitrogen). 
In this instance, the polyhistidine tag was removed. The AIK protein purity as determined on 
denaturing SDS-PAGE gel was 90-95%. AIK was not phosphorylated during the isolation and 
purification procedures as confirmed by mass spectrometry. AIK was concentrated to a final 
concentration of 9.4mg/ml and stored at 4°C in a buffer containing 50mM TRIS-HCl pH 7.6, 
250mM NaCl, ImM EDTA and ImM DTT. 
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Example 2. Crystallization of AIK-ATP7S complex 
[001971 This example describes the crystallization of AIK-ATPyS complex. It is noted 
that the precise crystallization conditions used may be further varied, for example by 
performing a fine screen based on these crystallization conditions. 

[00198] ADC protein samples were incubated with 2mM ATPyS and 4mM MgCl2 before 
setting crystallization trials. Crystals were obtained after an extensive and broad screen of 
conditions, followed by optimization. Table 2 summarizes effective crystallization conditions 
that were identified. 

[00199] Diffraction quality crystals were grown as in lOOnL sitting droplets using the 
vapor diffusion method. 50nL comprising the AIK-ATP7S complex (9.4mg/ml) was mixed 
with 50nL from a reservoir solution (lOO^iL) comprising O.IM Bicine pH=9.0, 20% PEG 
MME 550 and O.IM NaCL The resulting solution was incubated over a period of one week at 

20°C. 

[00200] Crystals typically appeared after 8-24 hours and grew to a maximum size within 
48 hours. Single crystals were separated from their parent cluster and transferred, briefly, into 
a cryoprotecting solution containing the reservoir solution supplemented with 20% v/v 
ethylene glycol. Crystals were then flash frozen by immersion in liquid nitrogen and then 
stored under liquid nitrogen. A crystal of AIK-ATPyS complex produced as described is 
illustrated in Figure 2. 

[00201] While the present invention is disclosed with reference to certain embodiments 

and examples detailed above, it is to be understood that these embodiments and examples are 
intended to be illustrative rather than limiting, as it is contemplated that modifications will 
readily occur to those skilled in the art, which modifications are intended to be within the scope 
of the invention and the appended claims. All patents, papers, and books cited in this 
application are incorporated herein in their entirety. 
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